A Novel Stroke Width Based Binarization Method to Handle Closely Spaced Thick Characters
نویسندگان
چکیده
Signboards and billboards provide a challenge to image seg¬mentation methods, since these images may also have pictures and graphical objects, apart from text objects. Methods that often succeed in more traditional text block segmentation situations do not perform well here since estimation of text lines and character widths etc fail due to the short sample sizes. Further, extraction of characters of different font sizes, which can be found in the real world and signboard images, remains a problem. In this paper, as a solution to the mentioned problem, we propose two stroke width based binarization approaches. These approaches can be used to eliminate extraneous objects based upon estimates of stroke width. We compare our methods with several other stroke width based binarization methods. We observe that the previous approaches fail, when there are closely spaced thick characters. We show that our second approach is able to extract closely spaced thick characters better than
منابع مشابه
Shape based local thresholding for binarization of document images
This paper presents a novel local threshold algorithm for the binarization of document images. Stroke width of handwritten and printed characters in documents is utilized as the shape feature. As a result, in addition to the intensity analysis, the proposed algorithm introduces the stroke width as shape information into local thresholding. Experimental results for both synthetic and practical d...
متن کاملStroke Width-Based Contrast Feature for Document Image Binarization
Automatic segmentation of foreground text from the background in degraded document images is very much essential for the smooth reading of the document content and recognition tasks by machine. In this paper, we present a novel approach to the binarization of degraded document images. The proposed method uses a new local contrast feature extracted based on the stroke width of text. First, a pre...
متن کاملA Binarization Method for Degraded Document Images with Morphological Operations
In this paper, we propose an effective binarization method for de-graded document images in this paper. This method employs morphological operations throughout its algorithm to suppress uneven illumination in the background region, to detect the character location and to reconstruct text regions. Moreover, a technique for estimating stroke width of characters is introduced to remove noises in a...
متن کاملUniqueness of bilevel image degradations
Two major degradations, edge displacement and comer erosion, change the appearance of bilevel images. The displacement of an edge determines stroke width, and the erosion ofa comer affects crispness. These degradations are functions of the system parameters: the point spread function (PSF) width and functional form, and the binarization threshold. Changing each of these parameters will affect a...
متن کاملEvolution maps and applications
Common tasks in document analysis, such as binarization, line extraction etc., are still considered difficult for highly degraded text documents. Having reliable fundamental information regarding the characters of the document, such as the distribution of character dimensions and stroke width, can significantly improve the performance of these tasks. We introduce a novel perspective of the imag...
متن کامل